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The subsystems of the CMS silicon strip tracker were integrated and commissioned at the Tracker 
Integration Facility (TIF) in the period from November 2006 to July 2007. As part of the commission- 
ing, large samples of cosmic ray data were recorded under various running conditions in the absence 
of a magnetic field. Cosmic rays detected by scintillation counters were used to trigger the readout of 
up to 15 % of the final silicon strip detector, and over 4.7 million events were recorded. This docu- 
ment describes the cosmic track reconstruction and presents results on the performance of track and 
hit reconstruction as from dedicated analyses. 



1 Introduction 



The CMS tracking system, composed of silicon pixel and micro-strip detectors, is designed to provide a precise 
and efficient measurement of the trajectories of charged particles emerging from the LHC collisions. With over 70 
million electronic channels and an active area of about 200 it is the largest silicon tracker ever built. 

First experience with tracker operations and track reconstruction was gained during summer 2006, when elements 
of the silicon strip tracker were operated at room temperature in a comprehensive slice test involving various 
CMS subdetectors. The tracker setup was limited and represented only 1% of the total electronic channels and an 
active area of 0.75 m^. Cosmic rays detected in the muon chambers were used to trigger the readout of all CMS 
subdetectors. The CMS superconducting solenoid provided a magnetic field of up to 4T. Over 25 million events 
were recorded, and the tracking performance was studied using both a dedicated algorithm for cosmic ray tracking 
and a general algorithm for track reconstruction in LHC collisions. In addition, tracks reconstructed in the silicon 
strip tracker were compared with tracks detected by the muon chambers. The results are summarized in Ref. |[T|. 

In the period from November 2006 to July 2007 the different subsystems of the silicon strip tracker were inte- 
grated and commissioned in a large clean room at CERN, the Tracker Integration Facility (TIF). As part of the 
commissioning large samples of cosmic ray data were recorded under different running conditions. No magnetic 
field was present, and the tracker setup consisted of up to 15% of the electronic channels. Over 4.7 million events 
were taken while operating the detector at five different temperature points. The data were used to verify the re- 
construction and calibration algorithms for low- and high-level objects and for comparison with simulated events. 
The tracking performance was studied using a dedicated cosmic track reconstruction algorithm and two standard 
tracking algorithms for LHC collisions modified for cosmic ray tracking. 

The results from tracker commissioning at the TIF are summarized in three publications. The operational aspects, 
commissioning studies and simulation tuning are described in Ref. The alignment of the silicon strip tracker 
using cosmic tracks, survey information and a laser alignment system are described in Ref. lO. In this paper the 
results of the track reconstruction are reported. 

The setup used for cosmic ray reconstruction is described in Section|2j followed by an overview of the data sets and 
the Monte Carlo simulation in Section[3] A brief introduction to the local reconstruction is presented in Section]?] 
The tracking algorithms are summarized in Section |5] Track performance results are presented in Section ]6] The 
validation of the track reconstruction is the subject of Section ]7] The hit performance studies related to tracking 
are described in Section]8]and conclusions are drawn in Sectionlo] 

2 Experimental Setup 

The CMS tracker occupies a cylindrical volume around the interaction point with a length of 5.8 m and a diameter 
of 2.5 m. The region closest to the interaction point is equipped with a pixel system, while the bulk consists of 
layers of silicon strip detectors. A schematic overview of the CMS tracker is shown in Figure]!] Throughout this 
note the standard CMS reference system is used. It has its origin in the center of the detector, with the z-axis along 
the beam line in the anti-clockwise direction for an observer standing in the middle of the LHC ring. The a:-axis 
points to the LHC center and the y-axis points upward. The azimuthal angle </> is measured starting from the a:-axis 
towards the y-axis. The polar radius r is defined as the distance from the z axis in the transverse x-y plane. 

The pixel detector is the innermost part of the tracking system. Three cylindrical layers of pixel detector modules 
are complemented by two disks of pixel modules on each side. The strip detector surrounds the pixel detector and 
is composed of four subsystems. The central region, up to a pseudorapidity of \ri\ « 1, is covered by the Tracker 
Inner Barrel (TIB) and the Tracker Outer Barrel (TOB). At each side of the TIB the remaining volume inside the 
TOB is filled by the Tracker Inner Disks (TID). The siUcon strip system is completed by two Tracker End Caps 
(TEC), extending the acceptance of the tracker up to a pseudorapidity of |r/| < 2.5. 

The TIB is composed of four layers using 320 /im thick silicon micro-strip sensors. The strip pitch is 80 /xm on 
layers 1 and 2 and 120 /im on layers 3 and 4. The TID consists of three disks on each side, also employing 320 /im 
thick silicon micro-strip sensors. Its mean pitch varies between 100 /im and 141 /im. The TOB encompasses the 
TIB/TID and consists of six layers of 500 thick sensors with strip pitches of 183 /im on the first four layers and 
122 /im on layers 5 and 6. The TEC is composed of nine disks on each side, carrying up to seven rings of silicon 
micro-strip detectors. The sensor thickness is 320 /im on the inner four rings and 500 /im on the outer three rings, 
and the mean pitch varies from 97 /im to 184 /im. 

All silicon strip subsystems are equipped with ref) modules. These modules have their strips parallel to the beam 
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axis in the barrel and radial on the disks. In addition, the modules in the first two layers and rings of TIB, TID and 
TOB as well as rings 1, 2 and 5 in the TECs carry a second micro-strip detector module, generally referred to as 
stereo module. The stereo modules are mounted back-to-back to the r(p modules with a stereo angle of lOOmrad, 
resulting in a measurement of z in the barrel and r on the disks. A detailed description of the CMS tracker can be 
found inRef. E). 



-1.7 






~~ 1200 


-1.9 ^ 




1000 


-2.1 ^ 




800 


-2.3 _^ 




-25 BOO 




400 


r (mm) 


200 









-200 




-400 




-BOO 




-800 




-1000 




-1200 



-0.9 -0.7 -0.5 -0.3 -0.1 0.1 0.3 0.5 
\ \ \ \ I I / / 



I I I I I I 

I I ll I III ll I ll 

II II II 



TEC- 



I I 



II 



'll 'll 'll 'll 'll 'll 'll 'll 



4 



I I 
I I 



PIXEL 



m 



II, I, I, II, II, I, 



I 



II 'II 



I, I I 

TEC+ 



II II II 

I '11 'IK 

II, II, II, II, II, I II, II, 



-2600 -2200 -1800 -1400 -1000 -600 -200 200 600 1000 1400 1800 2200 2600 

z (mm) ; 



Figure 1: Schematic layout of the CMS tracker Each line represents a detector module. 

Although all components of the silicon strip tracker were commissioned at the TIF, limitations in terms of space and 
cost and constraints from the data acquisition and cooling systems allowed only a partial readout of the detector 
The slice of the tracker operated at the TIF represented all four silicon strip subdetectors (TIB, TID, TOB, TEC) 
containing a total of 2161 modules. The pixel detector was not present. The majority of the readout modules were 
located in a sector defined as z > and ?/ > as shown on the left side of Figure |2] With about 1.3 million 
electronic channels, the tracker setup consisted of nearly 15 % of the final silicon strip setup. The active area 
amounted to 24.75 m^. The TIF layout is briefly summarized in Table [l] More information can be found in 
Ref. 121. 

Cosmic muon triggering was provided by scintillation counters mounted above and below the tracker A trigger 
signal was generated based on the coincidence of any top with any bottom scintillation counter Data were recorded 
in various trigger layouts, which are shown in Figure [2] Trigger configuration A was expected to primarily result 
in TIBh-TOB tracks. Trigger configuration B was chosen to select tracks passing closer to the nominal interaction 
point. Additional availability of scintillation counters allowed for extended coverage of the trigger system and to 
combine trigger configurations A and B to form configuration C. 

A lead plate with a thickness of 5 cm was located on top of the lower scintillation counters to avoid triggering on 
very low momentum tracks. This translates into a minimum cosmic energy of 200 MeV for the trigger system. 

Table 1 : Overview of the various silicon strip systems participating in the TIF cosmic data taking. 



SiUcon Strip Subdetector 


Number of Modules 


Percentage of Final System 


TIB 


437 


16% 


TID 


204 


25% 


TOB 


720 


14% 


TEC 


800 


13% 
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Figure 2: Layout of the various trigger scintillator positions used during the cosmic data taking at the TIF (in 
chronological order): (a) trigger configuration A; (b) trigger configuration B; (c) trigger configuration C. The x-y 
view is shown on the left side, the r-z view is shown on the right. The straight lines connecting the active areas of 
the top and bottom scintillation counters indicate the acceptance region. In the x-y view, the active TOB modules 
are shown in different shading, the active TIB area is framed in black and the active part of the TEC is indicated 
by the radial lines. 
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3 Data Sets and Monte Carlo Simulation 



3.1 Data Samples 

A total of over 4.7 million cosmic events were recorded during the tracker commissioning at the TIF in the period 
from March to July 2007. The recorded data are grouped into different data sets, defined by active detector, trigger 
setup and operating temperature. An overview of the nine different data sets is presented in Table|2] 

Depending on the configuration of the readout electronics the raw silicon strip detector data was written in one of 
two formats: 

1 . Zero Suppression: standard operation used for proton-proton collisions (reordering to physical channel order, 
subtraction of strip pedestals and common-mode noise). 

2. Virgin Raw Data: used for testing, commissioning and calibration (no reordering or pedestal/common-mode 
subtraction is performed). 

Each run was checked using online and offline data quality monitoring tools. If a run did not meet the quality 
requirements, it was flagged as bad. For example, the detector readout had to be well synchronized with the 
passage of cosmic muons in order to guarantee an optimal signal-to-noise ratio. In addition, results from global 
reconstruction were used to flag bad runs. A detailed description of the different good run selection criteria can be 
found in Ref. ||2l. Only events from good runs are used for offline analyses, reducing the total usable data set from 
over 4.7 million events to about 4.2 million events. 

The complete data set was processed several times using different software releases to integrate improvements 
in track reconstruction. The results reported in this note were obtained using the two most recent reconstruction 
passes. 

The data sample taken at — 15°C requires specific attention. Due to constraints from the cooling facility the 
number of powered modules had to be drastically reduced in all silicon strip sub-detectors. Although tracking is 
possible in this sample, the reconstructed tracks suffer from the hmited setup. 



Table 2: Overview of the TIF data samples used for the tracking analysis. 



Active Detector 


Trigger Configuration 


T [° C] 


Total Events 


Good Events 


TIBh-TIDh-TOB 


A 


+15 


703 996 


665409 


TIBh-TIDh-TOBh-TEC 


A 


+15 


191 154 


189925 


TIBh-TIDh-TOBh-TEC 


B 


+ 15 


193 337 


177 768 


TIBh-TIDh-TOBh-TEC 


C 


+15 


244 450 


241512 


TIBh-TIDh-TOBh-TEC 


C 


+10 


992 997 


534759 


TIB+TID+TOB+TEC 


C 


-1 


893 474 


886 801 


TIB+TID+TOB+TEC 


C 


-10 


923 571 


902 881 


TIB+TID+TOB+TEC 


C 


-15 


656 923 


655 301 


TIB+TID+TOB+TEC 


C 


+ 15 


112139 


112134 



3.2 Simulation 

The simulation of a realistic cosmic muon spectrum relies on dedicated parametrizations of the energy dependence 
and incident angle, also accounting for the correlations between the two quantities. For comparisons with TIF 
data, cosmic muons have been generated starting from an ideal cylinder coincident with the CMS surface using the 
CMSCGEN generator |5|. The parametrizations have been adopted from L3CGEN |6|, a cosmic muon generator 
developed for the L3+Cosmics experiment. The original parametrization of the spectrum is based on a polynomial 
fitted to data above an energy of lOGeV and cannot be used at much lower energies. In order to cover the 
range from 200 MeV to 2 Ge V the cosmic muon spectrum is produced assuming a simple energy dependence in 
CMSCGEN and reweighted using the CAPRICE |7| energy spectrum. Cosmic muons were generated with angles 
up to 88 deg from the vertical axis and in a time window of 25 ns. Since the tracker is on the surface without any 
iron shielding from muon stations outside, the energy loss scale factor was set equal to zero. 

Before the detector and electronics response are simulated, a special filter is applied reproducing the trigger setup. 
The scintillation counters are modeled as virtual 1 x 1 m^ surfaces, and the muon trajectories are extrapolated 
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to the outside of the tracker where the intersection points with the scintillator surface are calculated. Using the 
intersection points, the trigger logic is applied and the cosmic muon event is either retained or discarded. Simulated 
events are generated separately for all three trigger layouts. Further details about the simulation as well as angular 
and momentum distributions can be found in Ref. 

4 Local Reconstruction 

The local reconstruction is done in two consecutive stages, transforming the digitized information from the silicon 
strips into reconstructed hits in the local coordinate of the silicon sensors. The resulting reconstructed hits are input 
to the various track reconstruction algorithms which are described in Section|5] 

The raw data coming from the readout electronics of the silicon strip detectors are unpacked and grouped according 
to detector modules. After the unpacking step, the raw data are commonly referred to as digis, which denotes 
pedestal-subtracted and zero suppressed ADC counts for individual strips. The digis are associated with a detector 
ID, a unique number which encodes the location of each module in the mechanical structure of the CMS tracker. 
Using the digis, the local reconstruction is performed in stages: cluster reconstruction and hit conversion. 

1. Cluster reconstruction 

The cluster reconstruction groups adjacent strips whose associated charge pass a set of thresholds. The 
thresholds depend on the noise levels characterizing the strips of the cluster Clusters are reconstructed by 
searching for a seed strip with a signal-to-noise ratio (S/N) greater than 3. Neighboring strips are attached 
to the cluster if their signal-to-noise ratio exceeds 2. The total signal size of the cluster must exceed five 
times the quadratic sum of the individual strip noises. The cluster reconstruction algorithm is referred to as 
3-Threshold algorithm. The signal of a cluster is based on the sum of the ADC counts of all associated strips. 
In the most recent reconstruction pass a correction for the variations in the electronic gain was applied. These 
corrections had been derived from the height of digital synchronization signals Q. 

2. Hit conversion 

The hit conversion associates every cluster with a hit position and corresponding errors. The hit position is 
determined from the centroid of the signal heights. The position resolution is parametrized as a quadratic 
function of the projected track width on the sensor in the plane perpendicular to the strips ID. Deviations 
from the ideal geometry ("misalignment") are taken into account by adding an additional uncertainty on the 
module positions (Alignment Position Error, APE) to the hit errors. The size of the APEs was estimated 
from survey data |[3]. For the most recent reconstruction pass a first set of alignment constants was available, 
and the APEs were reduced to about 150 /im in the TOB and 500 - 600 /im in the other subdetectors. 

Details on the performance of the local reconstruction can be found in Ref. 

5 Track Reconstruction 

Three tracking algorithms were applied to TIF data: the two standard algorithms designed for the reconstruction 
of proton-proton collisions ("Combinatorial Kalman Filter" and "Road Search") and one specialized algorithm for 
the reconstruction of single track cosmic events ("Cosmic Track Finder"). They use the hits described in Section|4] 
The position estimates may depend on the local track angles. In addition, a reconstruction geometry describing the 
location of the modules and the distribution of passive material and condition information about the status of the 
different modules are needed. 

All three algorithms decompose the task of track reconstruction into three stages: 

1. seed finding, which provides a selection of initial hits and a first estimate of parameters, 

2. pattern recognition, which associates hits to a track, and 

3. track fitting, which determines the best estimate of the track parameters. 

The first two items are specific to each of the algorithms while the track fit is always performed by a Kalman 
filter and smoother. All these software modules use some common services. In the absence of a magnetic field 
the tracks are extrapolated as straight lines. Material effects - energy loss and multiple Coulomb scattering - are 
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estimated each time a track crosses a detector layer. The amount of material at normal incidence is obtained via 
the reconstruction geometry, and the same constants as for the reconstruction of proton-proton collisions are used. 
Since the momentum is not measured at the TIF, a constant value of 1 GeV/c is assigned, close to the expected 
average momentum of cosmic muons. 

The three track reconstruction modules have been designed (Cosmic Track Finder) or configured (Combinatorial 
Kalman Filter, Road Search) for the reconstruction of single track cosmic events. They were not optimized for 
the reconstruction of cosmic showers. Consequently, large multiplicity events with more than 300 reconstructed 
clusters were excluded from the track reconstruction. The algorithms and the fitting procedure are described in the 
following subsections. 

5.1 Combinatorial Kalman Filter 

The Combinatorial Kalman Filter (CKF) uses the capacity of the Kalman Filter |8 1 for simultaneous pattern recog- 
nition and track fitting. Starting from an initial estimate of the track parameters the algorithm iterates through the 
layers of the tracker and builds a combinatorial tree of track candidates. The CKF is identical to the one designed 
for proton-proton collisions except for the seed finding stage, which has been adapted for the reconstruction of 
cosmic tracks. 

5.1.1 Seed Finding 

In the standard tracking, i.e., for particles coming from the interaction point, seeds are created in the innermost 
layers of the tracking system. A seed is made out of a hit pair and a loose beamspot constraint or out of a hit triplet. 
The starting parameters of the trajectory are calculated from a helix passing through the three points. The selected 
hits must be pointing towards the interaction point and a minimum transverse momentum cut is applied. 

The situation for cosmic track reconstruction is very different with respect to proton-proton collisions, in particular: 

• No vertex constraint can be applied, since the cosmics do not necessarily cross the tracker pointing towards 
the interaction point. 

• Seeds should be created also in the outer layers, because these layers have a higher acceptance for cosmic 
tracks. 

Hence, the seed finding algorithm had to be modified to handle cosmic track reconstruction. In addition, the mod- 
ified version can be used for tracking of beam halo muons. For the reconstruction of TIF data, hit triplets are used 
in the inner or outer parts of the barrel and hit pairs in the endcaps. Hit triplets are checked for compatibility with 
a straight line; the radius of the circle passing through the three hits has to exceed 5 m. The different combinations 
of layers used for seeding are summarized in Table |3] 



Table 3: Combination of layers used for seed finding in the CKF algorithm. 



Seeds 


Layers 


Inner barrel triplets 
Outer barrel triplets 

Endcap pairs 


TIB1H-TIB2H-TIB3 
TOB4H-TOB5H-TOB6, TOB3h-TOB5h-TOB6, TOB3h-TOB4h-TOB5, 
TOB3H-TOB4H-TOB6, TOB2h-TOB4h-TOB5, TOB2h-TOB4h-TOB6 
Any pair of adjacent TEC wheels 



5.1.2 Pattern Recognition 

The computationally most time consuming part of track reconstruction is the pattern recognition, that is, the build- 
ing of a candidate trajectory by selecting its hits out of all the hits in the event. 

From each seed a propagation to the next surface is attempted. Hits are identified in a window whose width is 
related to the precision of the track parameters. If a hit is found within the window, it is added to the candidate 
trajectory and the track parameters are updated. The way in which the trajectory parameters are updated is de- 
scribed in some detail in Section [54| If several compatible hits are found, a new candidate is created for each 
of them. In addition, one candidate is created without adding any hit information ("lost hit"). Candidates are 
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No compatible High Chi2 for the trajectory 

measurement after the last measurement 

found: STOP \ V is added: STOP 




Figure 3: Schematic representation of the Kalman Fiher based pattern recognition. The points represent hits, the 
curved lines track candidates and the shaded boxes the size of the search window. 

sorted according to quahty (based on the and the number of hits) and the five best ones are retained for farther 
propagation. As hits are added to the candidate trajectory the knowledge of the track parameters improves and the 
sizes of the search windows decrease. Propagation of a candidate ends if configurable cuts on the number of layers 
or the number of consecutive layers without a hit are exceeded. The way the Kalman filter based CKF works is 
visualized in Figure [3] 

Even if cosmic events are mainly single track events, the pattern recognition is quite difficult. The tracker is 
designed to be hermetic and to offer optimal module overlaps for tracks coming from the interaction point. A 
cosmic ray track can encounter holes as well as zones of high module overlaps. 

The cosmic ray track reconstruction has been particularly useful to test and validate the CKF trajectory builder, 
specifically because it was possible to test its ability to find hits on the overlap regions. When two overlapping 
modules are crossed, it is very important to associate both hits to the candidate trajectory to give a stronger con- 
straint on the track parameters. Cosmics represent a suitable situation to test this, since the fraction of overlaps is 
higher than for tracks coming from the interaction point. 

5.2 Cosmic Track Finder 

The Cosmic Track Finder is designed as a simple and robust algorithm, tailored to the specific task of reconstructing 
single tracks without imposing a region of origin, but assuming a preferred direction. It has been applied with 
success in the CMS magnet test UJ- 

5.2.1 Seed Finding 

For cosmic track reconstruction in TIF data, a dedicated seeding is used. The total number of hits in the whole 
tracking volume is expected to be several orders of magnitude lower than in proton-proton collisions. Hence, all 
the hit pairs which are geometrically compatible are considered as potential seeds. The definition of compatibility 
is the following: 

• The seed is built from a pair of hits lying in different layers (barrel) or wheels (endcaps). In the barrel, 
all the hit pair combinations of either the three outermost TOB layers or of the three innermost TIB layers 
are considered. In the endcap, hit pair combinations are accepted if the hits are separated by at most one 
intermediate wheel. 

• The distance between the modules along the z-axis is less than 30 cm. This cut is not applied for seeds in 
TEC layers. 

• The distance between the hits in global x is less than two times the distance in global y. This requirement 
is motivated by the small angles cosmic tracks are expected to have with respect to the vertical direction. 
Again, this cut is not applied for seeds in TEC layers. 
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The maximum number of seeds is limited to 30. 
5.2.2 Pattern Recognition 

All seeds that fulfill the previous selection criteria are considered in the pattern recognition. For each seed, pre- 
liminary track parameters are calculated based on the line connecting the two hits. The hit-selecting algorithm is 
simplified with respect to the Combinatorial Kalman Filter A seed, which comes from the previous step, can be 
at the top or bottom of the instrumented region of the tracker. If it is at the top (bottom), all the hits with a global 
y coordinate lower (higher) than the hit of the seed are sorted in decreasing (increasing) order with respect to the 
global y axis. A very simple procedure establishes if the hit can be selected or not: 

1. The trajectory is propagated to the surface of the module which provided the hit. The uncertainty from 
multiple scattering is considered when the track is propagated. 

2. The compatibiUty of the hit with the propagated trajectory is evaluated using a estimator. For this analysis 
a cut at < 40 was chosen. 

3. If the hit is compatible, the trajectory is updated with the hit. 

A trajectory candidate is retained if it contains at least 4 hits. The final selection is only done after the full track fit. 

The fitting procedure is the same as for all the other CMS tracking algorithms and is based on the Kalman Filter 
At the end of this phase several trajectories are still valid, but only one is retained since only one track per event is 
expected. The best trajectory is selected on the basis of the criteria below, listed in order of precedence: 

1. the highest number of layers with hits in the trajectory, 

2. the highest number of hits in the trajectory, 

3. the lowest value of the x^. 

5.3 Road Search 

The Road Search (RS) algorithm treats the CMS tracker in terms of rings, where a ring contains all tracker modules 
at a given r-z position, spanning 360° in 0. A track will be a line in r-z, and the Road Search uses pre-defined sets 
of rings consistent with a line in r-z in which it will search for a track. These pre-defined sets of rings are referred 
to as roads. The standard Road Search algorithm, designed for use in proton-proton collisions, had to be slightly 
modified in order to reconstruct cosmic muons tracks. These modifications are described in the following. 

5.3.1 Seed Finding 

For seed finding, the Road Search algorithm uses pairs of hits in seed rings. The CMS tracking system uses three 
different types of hits (depending on silicon module type and arrangement), and all can be used for seeding: rcj), 
stereo, matched. While r0 and stereo hits are located on the respective modules (see Section |2]i, matched hits are 
virtual hits combining a compatible r(f) and stereo hit in the same layer into a precise 3-dimensional measurement. 
All three types of hits can be used in different variations for seeding. The set of rings that composes the road will 
be those consistent with the linear extrapolation between the seed rings in the r-z plane. The seed is composed 
of a pair of hits in the seed rings within a maximum A0, which effectively translates to a cut on the minimum 
transverse momentum of the track. 

In the standard Road Search algorithm, roads are constrained to point back to the luminous region of the beam, 
corresponding to roughly \z\ < 15 cm. Cosmic rays will not point back to z = 0, so for cosmic track reconstruction 
specific roads were generated, where the constraint on the extrapolation of the roads was loosened to include any 
pair of seed rings within the acceptance of the readout detector. An overview of the inner and outer seed rings for 
the TIF geometry is shown in Figure |4] The choice to use the inner layers of both TIB and TOB as inner seed rings 
(in addition to the inner rings of TID and TEC) was made to avoid any geometrical acceptance loss and to have 
one seed ring structure which fits different possible sub-detector readout combinations. 
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Figure 4: Overview of the ring structure of the CMS tracker. The rectangles in the plot represent the rings as 
defined in the Road Search algorithm. The inner seed rings are shown in dark grey (blue), the outer seed rings are 
shown in light grey (red). The seeding is asymmetric since modules located at —z were not part of the cosmic data 
taking. 

5.3.2 Pattern Recognition 

In the first part of the pattern recognition step, an expected trajectory is determined using the two seed hits and - 
for the reconstruction of proton-proton collisions - the beamspot. The trajectory is extrapolated through the other 
rings of the road and hits are collected inside a narrow window around the expected trajectory. This collection of 
hits is referred to as a cloud. The cloud should contain all the hits of a track, along with other hits that happen to 
overlap and lie close to the track. In the second part of pattern recognition, the cloud is turned into a trajectory. 
A trajectory is first built in low occupancy layers, extrapolating inside-out. With the trajectory well-defined from 
the low occupancy layers, hits from the higher occupancy layers are added to the trajectory. The final track will 
contain at most one hit per detector module, though potentially more than one hit per layer due to detector overlaps. 
A schematic presentation of the pattern recognition is shown in Figure |5] 

For cosmic track reconstruction, the following modifications have been implemented: the sorting of the hits within 
a cloud was changed from an inside-out sorting to a sorting in y. In addition, all hits are compared with the 
expected trajectory in r — 0, even for hits in the TID and TEC where the expected trajectory in z — is usually 
used. 



5.4 Track Fitting 

The Kalman Filter is a "dynamic" Least Squares Method lH). At the intersection with a detector surface the 
trajectory is described by a 5-dimensional state vector 

(1) 

V Pz Pz 

whose components are the inverse signed momentum and the angles and positions in two orthogonal directions 
in the local coordinate system. In this system, the normal to the sensor plane defines the z-axis while the j/-axis 
is aligned with the direction of the strips (the average direction in case of trapezoidal modules). In the absence 
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(a) (b) (c) (d) (e) 



Figure 5: Schematic representation of the Road Search algorithm. In (a) a trajectory is drawn through the two 
circled seed hits. All hits within a window around the trajectory (shaded region) are collected in the cloud. In (b) 
a new trajectory is built inside-out using only hits on the low occupancy layers of the cloud, resulting in the fitted 
trajectory in (c). This trajectory is extrapolated back to the innermost layer and then hits on the higher occupancy 
layers are tested (d). The best hit on each layer is used to yield the fitted track (e). 

of a magnetic field the momentum is not measured: | is set to an initial value and is not affected by the fitting 
procedure. 

The Kalman filter proceeds in an iterative way through the list of hits established by the pattern recognition. The 
state vector is initialized with an estimate provided by the pattern recognition. The errors are set to high values in 
order to avoid any bias. Then the following three steps are repeated for each hit in the list: 

1. The state vector and the corresponding covariance matrix are propagated to the reference surface of the hit, 
yielding a prediction for the state at the module. 

2. The hit information is combined with the prediction to form an "updated state". 

3. The of the track is increased based on the compatibility between the predicted state and the hit. 

After the last hit the full information has been included and the best estimate of the track parameters is available at 
the end of the trajectory. 

In order to achieve maximum precision on all the surfaces (particularly the first one), a second fit is run in the 
opposite direction. The updated states of this backward fit are combined with the predicted states of the forward 
fit to obtain the final estimates ("smoothing"). The results of the smoothing can also be used to calculate the 
compatibility between a hit and the combination of forward and backward predicted states, i.e., the full information 
provided by all other hits. 



6 Tracking Performance 
6.1 Track Reconstruction Results 



The cosmic run data have been split into nine different data samples (see Section 3. 1 1. The number of reconstructed 
single track events for all three track reconstruction algorithms - without applying any track quality cuts - is 
presented in Table |4] In contrast to the Cosmic Track Finder, both the CKF and the Road Search algorithm are 
able to reconstruct more than one track per event. Hence, it is expected that the number of single track events will 
be higher for the Cosmic Track Finder in comparison to the other two algorithms. Events with multiple tracks are 
characterized by a larger amount of wrongly reconstructed tracks and require special dedicated studies. 
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To visualize the standard track reconstruction results, the data sample taken at —10° C in trigger configuration C 
using all silicon strip sub-detectors has been chosen. The number of reconstructed tracks for all three tracking 
algorithms is shown in Figure [6] along with various track distributions in single track events. Apart from the 
different numbers of reconstructed tracks, all three tracking algorithms lead to similar results. The (j) distribution 
shows a peak around — 7r/2, compatible with tracks that originate from the top of the detector and travel outside 
in. The rj distribution is compatible with the trigger layout. The number of rcf) and stereo hits per track shows a 
most probable value of 8 hits for all algorithms. The cluster charge distribution shows that no algorithm includes a 
significant number of noise hits. 

Table 4: Number of reconstructed single track events for all three tracking algorithms in the different data samples. 
The significant change observed for data taken at —15° C is due to the smaller number of powered modules (see 
section [6^ . 



Active Detector 


Trigger 


T [° C] 


Number of Single Track Events 




Configuration 




Cosmic Track Finder 


CKF 


Road Search 


TIB+TID+TOB 


A 


+15 


502 505 


493 247 


476949 


TIB+TID+TOB+TEC 


A 


+15 


74116 


71687 


65 521 


TIB+TID+TOB+TEC 


B 


+15 


103 219 


98 085 


92 311 


TIB+TID+TOB+TEC 


C 


+15 


134 844 


132190 


122 528 


TIB+TID+TOB+TEC 


C 


+10 


309 554 


295 288 


273 410 


TIB+TID+TOB+TEC 


C 


-1 


475 467 


452 501 


418 948 


TIB+TID+TOB+TEC 


C 


-10 


528 225 


501 577 


469 619 


TIB+TID+TOB+TEC 


C 


-15 


79406 


16810 


77 181 



The effect of the different trigger scintillator positions on the track reconstruction can be seen in Figure [7] The 
histograms show -q and zo (measured at the point of closest approach to the origin in the transverse plane) of re- 
constructed tracks in single track events. The compared data samples have the same active detector and operating 
temperature, but differ in the position of the trigger scintillation counters as indicated in Figure |2] Trigger config- 
uration A shows a large peak around 77 w 0.1 stemming mostly from TIB+TOB tracks. A second peak is visible 
around 77 w 0.8, formed primarily by tracks with TEC hits. In the zq distribution, the TIB+TOB tracks give rise to 
the main peak at zq ~ 30 cm, while tracks with mostly TEC hits lead to a peak at zq ~ 120 cm. For trigger configu- 
ration B, the lower scintillation counter was moved towards negative z, while the upper scintillation counters were 
moved slightly in the positive z direction. This change led to a significant reduction in the number of TIB+TOB 
tracks contributing to the peak at zo ~ —30 cm. Tracks with TEC hits can mostly be found at zq w 25 cm. Trigger 
configuration C is a combination of configurations A and B using an enhanced trigger setup. All three tracking 
algorithms give similar distributions, and for illustration purposes only results from the Road Search algorithm are 
shown. 

6.2 Stability of Track Reconstruction Results 

Control plots have been generated in order to monitor the stability of the track reconstruction results and to study 
the dependence on the operating temperature as a function of time. Figure [8] shows the distribution of various 
reconstructed track parameters for single track events from all three algorithms. For each cosmic run, the mean 
of the parameter under study has been extracted and is plotted as a function of the run number Each histogram 
indicates the active detectors, the various scintillator positions and the respective operating temperature. 

It is expected that the tracking parameters do not depend on the operating temperature, while a clear dependence on 
the trigger setup and the active detectors should be seen. This can be observed for trigger configuration C, where 
the temperature was gradually decreased from +15° C to —15° C. The reconstructed track parameters do not 
show a dependence on the operating temperature, except for data taken at —15° C. In order to reach this operating 
temperature, a significant amount of modules in all four sub-detectors had to be turned off, effectively changing 
the active detectors. The tracking algorithms were not retuned for this situation. The differences in their seeding 
configurations translate into a different acceptance and explain the variations in the number of reconstructed tracks 
and the track parameter distributions. 

Figure[8](f) shows a dependence of the average cluster charge before gain correction on the operating temperature. 
Details of the effect of the gain corrections are shown in Ref . [2 1 . 

Various subtle changes can be observed for TIB+TOB in trigger configuration A, which is the result of a correction 
of the scintillation counter position. 
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Figure 6: Number of reconstructed tracks and various track distributions in single track events using data taken at 
T — — 10° C in trigger configuration C with TIB+TOB+TEC as active detector. The results of the Cosmic Track 
Finder are shown as solid line, while results from the CKF are shown as a dashed line and results from the Road 
Search algorithm are shown as a dotted line. 
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Figure 7: Comparison of rj and zq in single track events for the three different trigger layouts. All three data 
samples were taken at T = +15° C and are normalized to each other. The tracks are reconstructed with the Road 
Search algorithm; the results for the Cosmic Track Finder and CKF are similar. No quality cuts have been applied. 



6.3 Processing Time 

The processing time of the track reconstruction algorithms has been studied in both data and simulated events. 
Results for the various configurations are shown in Table |5] Due to the low rate of cosmic ray tracks and the 
expected low occupancy in the detector, no dedicated timing optimizations have been performed for any of the 
three tracking algorithms. However, by comparing the reconstruction times of the different algorithms with each 
other it is possible to detect inefficiencies in, for example, the seeding or pattern recognition step. 

The timing results for simulated events show that the fraction of the overall amount of processing time spent for 
the tracking varies between 2.9% for the Cosmic Track Finder to 8.8% for the RS. The individual timing results of 
the algorithms are compatible with each other The Cosmic Track Finder is the fastest of all three reconstruction 
algorithms due to its simplified structure. 

A direct comparison of the timing results for data is difficult, even for runs taken in the same configuration. 
Nevertheless, a comparison allows general conclusions to be drawn. In Table|5]it can be observed that the overall 
processing time per event is significantly higher in virgin raw (VR) mode compared to zero suppressed (ZS) mode, 
except for data taken at —15° C. This behavior is expected since all channels are read out in virgin raw mode and 
the zero suppression is done offline. The comparable reconstruction time for both virgin raw and zero suppressed 
data taken at —15° C is due to the limited detector geometry that is read out in this configuration. Concerning the 
tracking algorithms it can be observed that the processing time for both the CKF and the Road Search algorithm 
are comparable, while the Cosmic Track Finder is significantly faster. This is caused by the more elaborate pattern 
recognition stages of the CKF and Road Search algorithms. 



Table 5: Reconstruction times for various configurations in both data and simulated events. 



Trigger 


T 


Readout 




Time/Event [arbitrary units] 




Conf. 


[° C] 


Mode 


Total 


Road Search 


CKF 


Cosmic T.F. 


Simulated Events 


A 


n/a 


n/a 


0.2093 + 0.0211 


0.0184 + 0.0003 


0.0121 + 0.0001 


0.0067 + 0.0001 


C 


n/a 


n/a 


0.2033 + 0.0197 


0.0179 + 0.0003 


0.0127 + 0.0002 


0.0059 + 0.0001 


Data 


A 


+15 


VR 


2.2524 + 0.0105 


0.0455 + 0.0049 


0.0638 + 0.0035 


0.0131 + 0.0002 


B 


+15 


VR 


2.2641 + 0.0092 


0.0373 + 0.0017 


0.0895 + 0.0066 


0.0105 + 0.0002 


B 


+15 


ZS 


0.1931 + 0.0060 


0.0341 + 0.0014 


0.0744 + 0.0042 


0.0091 + 0.0002 


C 


+15 


VR 


2.3698 + 0.0111 


0.1997 + 0.0048 


0.0569 + 0.0030 


0.0100 + 0.0002 


C 


+10 


VR 


2.2745 + 0.0155 


0.0636 + 0.0116 


0.0701 + 0.0048 


0.0100 + 0.0002 


C 


-1 


VR 


2.3714 + 0.0112 


0.0504 + 0.0072 


0.0604 + 0.0034 


0.0092 + 0.0002 


C 


-10 


VR 


2.5425 + 0.0498 


0.2257 + 0.0479 


0.0576 + 0.0034 


0.0096 + 0.0002 


C 


-10 


ZS 


0.2034 + 0.0090 


0.0331 + 0.0019 


0.0631 + 0.0044 


0.0095 + 0.0002 


C 


-15 


VR 


0.8292 + 0.0174 


0.0637 + 0.0149 


0.0159 + 0.0018 


0.0028 + 0.0001 


C 


-15 


ZS 


0.1062 + 0.0089 


0.0456 + 0.0058 


0.0178 + 0.0016 


0.0026 + 0.0001 
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Figure 8; Control plots showing the mean of various reconstructed track parameters for all three algorithms as a 
function of run number The scintillator positions and the operating temperatures (in ° C) are indicated at the top 
of each frame. All distributions are shown for single track events: (a) number of single track events divided by 
triggered events; (b) number of hits; (c) 77; (d) 0; (e) /ndof; (f) cluster charge in ADC counts of hits belonging 
to the track before gain correction. The differences observed at — 15° C are due to the reduced number of active 
modules. 
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6.4 Tracking Efficiency 



TIF data were used to verify the efficiency of the tracking algorithms. The estimation of the tracking efficiency 
is a challenging task, due to the poor prior constraints available for cosmic tracks and the absence of an external 
reference other than the scintillator counters. The strategy adopted for this analysis is the reconstruction of partial 
tracks, using only a subset of the tracker These track segments serve as a reference for the reconstruction in the 
remaining parts of the tracker. For statistical reasons, tracks in the TIB and TOB were chosen as independent 
subsets of the tracker. In the following discussion, track segments reconstructed in TIB will be referred to as TIB 
tracks, while track segments reconstructed in the TOB are referred to as TOB tracks. 

To isolate single track events, only events with less than 30 reconstructed hits were analyzed. TIB and TOB tracks 
were accepted if the normalized was smaller than 30 and if they contained hits with signal-to-noise ratio greater 
than 8 in at least four different layers. 

Based on the selected events, the tracking efficiency in TIB (TOB) was calculated from the fraction of TOB 
(TIB) tracks with a matching track in the other sub-detector. This conditional efficiency can deviate from the 
global efficiency if the TIB and TOB acceptance differ after selection. The match between tracks was based on 
a comparison of the azimuthal angles. The difference was required to be smaller than five times the resolution 
determined from simulation. In addition, the reference track had to contain at least two hits in stereo layers and the 
extrapolation had to be fully contained in the active region of the other sub-detector. 



Table 6: Average conditional track efficiencies and corresponding statistical uncertainties for all three track recon- 
struction algorithms in data and Monte Carlo simulation. 





e(TIBf 
Data 


FOB) [%] 
MC 


e(TOB 
Data 


TIB) [%] 
MC 


Combinatorial Kalman Filter 
Cosmic Track Finder 
Road Search 


94.0 ±0.2 

93.1 ±0.2 
89.9 ±0.2 


98.66 ±0.04 
94.46 ± 0.09 
89.08 ±0.12 


97.7±0.1 
96.9 ±0.1 
99.0 ±0.1 


98.76 ±0.04 
97.36 ±0.06 
99.39 ±0.03 



The average efficiencies for all three track reconstruction algorithms are shown in Table |6] The results have been 
obtained using data taken at +10° C, —1° C and —10° C in scintillator position C. The corresponding values for 
simulated events are listed in the same table. The stability of the conditional track reconstruction efficiency is 
demonstrated in Figure |9] which shows the distribution of e(TIB|TOB) and e(TOB|TIB) for the CKF as a function 
of run number. 



The dependence of the conditional track reconstruction efficiency on the pseudorapidity is shown in Figure 10 for 
both data and simulated events. As an example the distributions from the Combinatorial Kalman Filter have been 
chosen. Possible reasons for the slightly higher efficiency found in simulation are differences of the acceptance in 
data and simulation and of the resolution on the track angle, which enters via the match of TIB and TOB tracks. 



The difference in acceptance is confirmed by the angular distributions of the reference tracks, shown in Figure 1 1 



6.5 Momentum Estimation from Scattering 

Since the TIF cosmic data were taken without a magnetic field, it is difficult to select a precise, i.e. high- 
momentum, track sample to be used for detailed track studies. The majority of tracks are low momentum and 
dominated by multiple scattering. In the CMS tracker, the material contribution for each layer is x/Xq « 2.5%. 
Only for momentum above about 20 GeV/c does the multiple scattering contribution fall below the intrinsic reso- 
lution. 

The multiple scattering contribution can be used in order to obtain a measure of the momentum of each track. This 
is done by refitting the tracks with the momentum floating. The scattering contribution to the resolution of each 
layer is calculated from the assumed momentum and the projection distance between layers, and the momentum 
that makes the track fit equal to one is used as the best momentum estimate. 

Since the scattering is a statistical process, this estimate does not provide a precise measure of the momentum of 
each track. However, the ensemble of tracks provides a measure of the momentum spectrum that can be compared 



to simulation. In Figure 12 (a) the momentum distribution, calculated as described above, for simulated cosmic ray 
muons with momenta in three different ranges are compared. Though there is large spread, the estimate follows 
the true momentum, and the low and high momentum extremes are well separable. 
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Figure 9: Conditional track reconstruction efficiencies e(TIB|TOB) (triangles) and e(TOB|TIB) (circles) for the 
CKF as a function of the run number. 
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Figure 10: Conditional track reconstruction efficiency e(TIB|TOB) (left) and e(TOB|TIB) (right) for the CKF in 
data as a function of the pseudorapidity (r/). Downward and upward pointing triangles indicate data and simulated 
events, respectively. 
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Figure 11: Distributions of (j> (left) and rj (right) of reference tracks in the TOB. Data and simulated events are 
indicated by black and red lines, respectively. Distributions from the CKF algorithm are shown. 
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This momentum estimate was performed in data using tracks reconstructed in the TOB. The TOB ahgnment 
corrections provide single layer hit resolutions of about 40 /im, which is small compared to the scattering effects 



over most of the momentum range. The results are shown in Figure 12 (b) and compared to the momentum 



spectrum obtained from simulated data. There is generally good agreement. 

This momentum estimate was done to validate the tracking and resolution modeling. It also provides a track quality 
measure, which could be used in other studies, equivalent to a momentum cut in the absence of a magnetic field. 
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Figure 12: (a) The "scattering-estimated" momentum spectrum is compared for TOB tracks in simulated cosmic 
ray muons with three different momentum ranges, < p < 0.5 GeV/c (dashed line), 2 < p < 3 GeV/c (dotted 
line), and p > 20 GeV/c (full line). The spread of the momentum estimate is large, but the low and high momen- 
tum extremes are well separable, (b) The "scattering-estimated" momentum spectrum is compared for TOB tracks 
in post- alignment cosmic data (full line) and simulation (dashed line). Reasonable agreement between simulation 
and data indicates that the scattering and intrinsic resolution functions are well described. The last bin contains 
overflows, i.e., tracks for which no scattering contribution is needed to obtain a good fit. 



6.6 Energy Loss Measurement for Cosmic Muons 

The signal height from a strip is related to the number of electron-hole pairs created by the traversing particle in 
the bulk of the silicon sensor The analogue readout of the CMS tracking system retains this information, and each 
reconstructed hit is subsequently associated with a corresponding cluster charge in ADC counts. The measurement 
of the energy lost per unit pathlength (dE/dx) by charged particles in silicon, in conjunction with the measurement 
of their momentum, is a valuable tool for particle identification. The dE/dx of the track can provide additional 
information for the identification of electrons in jets - complementing the information from the electromagnetic 
calorimeter - and most importantly allows the discrimination between different charged hadron species. 

Each track crosses several micro-strip modules, and each crossing gives an independent dE/dx measurement 
based on the deposited energy normalized to the estimated pathlength in the active part of the module. Since the 
deposited energy is measured in ADC counts, a conversion factor has to be applied. In the following the default 
value of 250 electron-hole pairs per ADC count is used for the silicon strip tracker. This is multiplied by the energy 
needed to create an electron-hole pair, 3.61 eV. The conversion includes also a correction for the variations in the 
electronic gain. The precision of these correction factors has a direct impact on the homogeneity of the measured 
signal and, hence, on the resolution. 

Given the trajectory of a charged particle it is possible to measure the energy deposited in the different layers of 
the tracking system and to attribute a single dE/dx estimator to the track. The loss of energy after each traversal 
is negligible with respect to the momenta considered, justifying the assumption of a unique dE/dx value along 
all the trajectory of the particle. Various most-probable-value estimators for the track dE/dx have been studied in 
simulated events (see Ref. |||9J), and the two most promising methods are studied using data; 

• Truncated mean 
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The highest values (in a fixed fraction of the total number of observations) are discarded, and the arithmetic 
mean is computed for the remaining values. The method has little sensitivity to outliers, but it introduces a 
bias which depends on the arbitrary cut-off parameter (truncation fraction) and it reduces the statistics. 

• Generalized mean 

The generalized mean with exponent A: of a variable x is defined as: 

\ i=l 

A special case of the generalized mean with fc = — 1 is the harmonic mean. The harmonic mean of a set of 
observations is given by the number of terms divided by the sum of the reciprocals of the term. 



J. / h. 



Based on the results of the energy loss study in Ref. ||9l, a truncation fraction of 20% and 40% and the exponent 
k = —2 and k = —4 have been chosen for comparison. 

The dE/dx study is performed using data taken at —10° C in trigger configuration C. This data sample is close to 
the nominal tracker operation temperature and has large statistics. The following quality requirements are imposed; 



• Exactly one track reconstructed in the event. 

• Number of hits > 4 (with matched hits being resolved into their rcf) and stereo component and counted 
separately). 

• x^/ndof < 10. 



Assuming that the different track reconstruction algorithms reconstruct the same track with exactly the same hits 
associated, the measured dE/dx should be identical and independent of the chosen tracking algorithm. This has 
been confirmed by comparing events where both the CKF and the Road Search algorithm find a track that satisfies 
the above Usted quality requirements. The resulting dE/dx distributions with the corresponding means and widths 
are found to be exactly identical for both algorithms. Hence, to eliminate any dependence on the choice of tracking 
algorithm for this study, only events where both algorithms find the same track are retained for the analysis. 



Results for the two types of dE/dx estimators are shown in Figure 13 Mean values and resolutions have been ex- 
tracted from the measured dE/dx distributions by fits to a Gaussian distribution, and the good agreement indicates 
that the fake rate from misidentified tracks or hits is low. The calculated dE/dx resolutions are shown in Table [t] 
It follows that the generalized mean estimator with k — —4 gives the best dE/dx resolution of all four estimators, 
and that a truncation fraction of 40% results in a better resolution for the truncated mean than a truncation fraction 
of 20%. All these results are consistent with the study on simulated events in Ref. ||9l . 



Table 7: dE/dx resolutions for different estimators in TIF data taken at —10° C. 



dE / dx Estimator 


Resolution ([%]) 


Truncated mean (20%) 
Truncated mean (40%) 
Generalized mean (k — —2) 
Generalized mean (k — —4)) 


9.78±0.07 
9.18±0.07 
9.51±0.07 
9.01±0.07 



It is expected that the dE/dx resolution improves with an increasing number of track hits. This has been verified 
in data for both the generalized mean and the truncated mean by measuring dE/dx in samples with a different 
requirement on the number of hits per track. Three samples were studied: (i) 5< #hits < 6, (ii) 8< #hits < 10, 
(iii) #hits >15. Figure [T4| shows the dE/dx for the truncated mean in all three samples. The resolutions are 
summarized in Table [s] The dE/dx resolution improves as expected with increasing number of track hits. The 
dE/dx resolution obtained in the sample requiring at least 15 hits is consistent with the value obtained using 
simulated muon events in Ref. |[9l . 

Measurements at different temperatures are reported in Table |9] The mean dE/dx shows a slight increase with 
increasing operating temperature which is attributed to a residual miscalibration due to changes in the operating 
point of the readout electronics. 
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Figure 13: Results for different track dE/dx estimators for TIF data taken at —10° C: truncated mean with a 
truncation fraction of 20% (left); generalized mean with exponent k = —2 (right). 



7 Validation of Track Reconstruction 

7.1 Reconstructed Track Position at Scintillation Counters 

For the visual validation of the tracking performance, a set of pseudo hit-maps for the trigger scintillation counters 
has been compiled. Two reference planes, coplanar to the global xz-plane, at y = 160 cm and y = —160 cm 
respectively, were used. For every reconstructed track, the trajectory state at the outermost hit was propagated to 
the upper reference plane using a straight-line track-model. Similarly, the trajectory states at the innermost hits 
were propagated the same way to the lower reference plane. Figure [15] shows the resulting intersection points at 
the upper plane. 

The shapes of the six scintillation counters can be clearly distinguished. Additionally their roughly determined po- 
sitions and dimensions are in good agreement with the experimental setup. Two of the counters show a drastically 



decreasing light-yield for positions further away from the photocathode, as seen in Figure 15 (a) and (b). This is 
most likely due to damages of the devices themselves, rather than from any tracking inefficiencies. The vertical 
lines at constant z positions stem from barrel tracks that are built only out of rcf) hits. Due to missing z information 
from stereo modules these tracks are accumulated at specific positions. 



7.2 Comparison of Virgin Raw and Zero Suppressed Data 

The vast majority of the TIF data have been taken in virgin raw mode, using specific code to do the zero suppression 
offline. However, over 100k events have been collected in zero suppression mode, and this data have been used to 
compare the tracking results from both readout modes. A detailed comparison of virgin raw and zero suppressed 
data on the cluster reconstruction level can be found in Ref. 12J. 

The complete zero suppressed data set has been taken at room temperature with scintillator position A. Only TIB 
and TOB have been included in the readout. The tracking results in the zero suppressed data set are compared to 
results that are obtained in a virgin raw data set taken under the same conditions. The results of the comparison are 



shown in Figure 16 for the Road Search algorithm. Both the Cosmic Track Finder and the CKF give similar results 
and are not shown here. The two data sets were normalized by using the single track events. A global normalization 
factor has been derived by comparing the number of single track events in both samples. The normalization factor 
has been applied to all the distributions. The track parameters are only shown for single track events. A rather 
good agreement between the tracking results from zero suppressed and virgin raw data is observed. There are small 
discrepancies which can be attributed to the reduced resolution in the digitization in zero suppressed mode. This 
reduction in the dynamic range per strip generates a spike at 255 ADC counts for high-charge single strip clusters, 
which is observable in the cluster charge distribution. 
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Figure 14: Energy loss estimation as a function of the number of hits per track in TIF data taken at —10° C. The 
histograms show the truncated mean with a truncation fraction of 40%. Tracks are required to have 5 or 6 hits (a), 
between 8 and 10 (b), or at least 15 hits (c). 



Table 8: dE/dx resolution as a function of the number of hits per track in TIF data taken at —10° C. 



Number of Hits 


Resolution [%] 




Truncated Mean 


Generalized Mean 


5 or 6 


11.58±0.21 


11.20±0.22 


8 to 10 


9.29±0.10 


9.04±0.10 


> 15 


6.35±0.17 


6.25±0.17 



Table 9: dE/dx measurements and resolutions for data taken at +10° C, —1° C and —10° C in trigger configura- 
tion C. The results are based on the the truncated mean method with a truncation fraction of 40%. 



Temperature [° C] 


dE/dx [MeV/cm] 


Resolution [%] 


-10 


2.596+0.002 


9.18+0.07 


-1 


2.643+0.002 


9.45+0.06 


+10 


2.684+0.002 


9.35+0.06 
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Figure 15: Reconstructed track position at the surface defined by the upper scintillation counters in position C: (a) 
scintillation counters for the barrel readout; (b) scintillation counters for the end-cap readout. 



7.3 Regional Unpacking 

In order to speed up reconstruction for the High-Level Trigger ifTOll . a new software scheme has been developed. 
The new implementation of the cluster reconstruction improves the performance significantly by combining the 
unpacking of the raw data and the cluster finding into a single step. Both unpacking and cluster reconstruction can 
now be done at the request of the pattern recognition. Hence, only a fraction of the tracker has to be unpacked, in 
contrast to the currently deployed scheme where the full tracker data has to be converted for local reconstruction. 

The new scheme was tested on data from one run with unpacking and cluster reconstruction triggered by the pattern 
recognition (On Demand Mode) and with unpacking and cluster reconstruction of all tracker data [Global Mode). 
For both tests the improved cluster reconstruction code has been deployed. The CKF algorithm has been chosen for 
track reconstruction. The resulting tracks were compared with tracks produced using the standard reconstruction 
and found to be identical. 



Table 10 shows the processing time for the different track reconstruction steps in several configurations. All three 
configurations use the standard seeding configuration (see Table |3]l, and the full tracker data is unpacked. In the 
first case the standard cluster reconstruction algorithm has been used, resulting in a cluster container which is used 
for the subsequent seed finding. In the second case the improved cluster reconstruction algorithm is deployed, 
but a standard cluster container is built so that the standard track reconstruction configuration can be used. In the 
third case the unpacking and clusterization of the seeding layers is done in the seeding step, while the remainder 
is done during the pattern recognition. This avoids the use of the standard cluster container. Comparison of the 
time to produce clusters in the first two configurations shows a large improvement due to the improved cluster 
reconstruction algorithm (by a factor of 7.7). Comparing the overall track reconstruction time for the first two 
configurations to the third shows a further improvement. 



Table 10: Processing time for the different steps involved in local and track reconstruction. The standard CKF 
seeding is used. 







Time [arbitrary units] 




Standard 


New Cluster Reconstruction 


New Cluster Reconstruction 






(using standard cluster container) 




Unpacking & Cluster Reconstruction 


209.7±1.7 


27.1±0.009 




Seed Finding 


6.5±1.3 


5.6±0.8 


33.4±0.2 


Pattern Recognition 


13.4±1.5 


14.2±2.0 


11.6±1.9 


Track Fitting 


1.8±0.5 


1.8±0.5 


1.8±0.5 


Total 


218.5±1.5 


42.2±1.1 


39.9±1.1 



The standard seeding configuration uses a large fraction of the silicon strip tracker to seed (see Table [5]). Hence, 
all these layers have to be unpacked for the seed finding. In order to study the performance of the new software 
scheme in on demand mode, the seeding region was restricted to the outer three layers of TOB. This necessarily 
leads to a reduction in the number of reconstructed tracks, and thus the timing results can not be compared to those 



22 



CO 90000 r 

gj 

■g 80000 : 

LU 

70000 : 
60000 : 
60000 E 
40000 E 
30000 " 
20000 E 

10000 E 

= 



- Virgin Raw 

■ Zero Suppressed 



Virgin Raw 
Entries 133933 
Mean 0.7861 
RMS 0.6732 

Zero Suppressed 



Entries 

Mean 

RMS 



123636 
0.8068 
0.6896 



7 8 9 

#Tracks 



LU 



12000 
10000 
8000 
6000 
4000 
2000 




10 



Virgin Raw 

Zero Suppressed 

Virgin Raw 

Entries 91697 
Mean 8.784 
RMS 2.369 

Zero Suppressed 



Entries 

Mean 

RMS 



85901 
8.79 
2.378 



1 



15 



20 



#Hits 



- Virgin Raw 

■ Zero Suppressed 



Virgin Raw 



Entries 91697 
Mean 0.1311 
RMS 0.1499 

Zero Suppressed 
Entries 85901 
Mean 0.1311 
RMS 0.1491 



24000 ^ 

22000 

20000 

18000 

16000 

14000 

12000 

10000 

8000 

6000 

4000 

2000 ~- 

o4r 



Jl 



-2 -1 



^ Virgin Raw 

- - Zero Suppressed 



Virgin Raw 



Entries 

Mean 

RMS 



91697 
-1.539 

0.1229 



Zero Suppressed 
Entries 85901 
Mean -1 .538 

RMS 0.1246 




Figure 16: Comparison of tracking results obtained using data taken in zero suppressed mode and in virgin raw 
mode: number of tracks (top left), number of hits per track (top right), rj (middle left), (f) (middle right), x^/ndof 
(bottom left) and cluster charge (bottom right). As an example the distributions from the Road Search algorithm 
are shown - the other algorithms show the same characteristics. The spike in the cluster charge distribution for 
zero suppressed mode is due to the limited dynamic range for a single channel. 
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obtained using standard seeding. Table 1 1 shows the processing time for two configurations that use the modified 
seeding layout. In the first configuration the unpacking of the three seeding layers is performed in the seed finding 
step, while the remaining layers are all unpacked in the pattern recognition step. In the second configuration the 
remainder of the strip tracker is only unpacked when requested by the pattern recognition. Comparison of these two 
configurations shows a further improvement in the track reconstruction when unneeded regions are not unpacked 
(by a factor of nearly 3). 



Table 1 1 : Processing time for the different steps involved in local and track reconstruction. The seeding region 
was restricted to the outer three layers of the TOB in order to compare the on demand unpacking to the global 
unpacking in the pattern recognition step. 





Time [ai 
Global Mode 


■bitrary units] 
On Demand Mode 


Seed Finding 
Pattern Recognition 
Track Fitting 


11.3±0.6 
30.9±0.8 
0.9±0.2 


11.5±0.6 
10.8±1.0 
0.9±0.2 


Total 


42.7±1.3 


21.6±1.5 



In some events large numbers of seeds are found. This leads to a substantial increase in tracking time due to 
the large number of combinations. In order to study the trend in unpacking time, these events were excluded by 
requiring events to have less than 3 tracks and less than 100 clusters. Figure [T7| shows the total time taken to 
reconstruct tracks from raw data as a function of the fraction of the silicon strip tracker which has to be unpacked. 
The lower limit is due to the three TOB layers unpacked for seeding, where no seeds have been found. There 
appears to be a linear relationship between this fraction and the time. 
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Figure 17: Total reconstruction time (in arbitrary units) from raw data to tracks versus the fraction of the active 
silicon strip detector being unpacked. 



8 Hit Reconstruction Efficiency and Resolution 

8.1 Hit Finding Efficiency 

The hit reconstruction efficiency for modules was estimated using the capacity of the Kalman fitter to provide 
optimal predictions of the track parameters on all surfaces crossed by a track. For the pattern recognition part the 
CKF was used and the analysis was performed for all modules of a given layer at a time. To avoid any bias due to 
correlations between hit and track finding efficiencies, none of the hits in the layer under consideration were used 
in seeding or pattern recognition. The CKF would still record intersections with modules on the tested layer as 



"lost" hits. The choice of seeding layers is summarized in Table 12 
A sample of well-reconstructed events was selected by requiring: 

• Exactly one track reconstructed by the CKF. 
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Table 12: Seeding layers used for different layers under test. 



Processed Layers 


Seeding Layers 


TIB 2,3,4 
TOB 1,2,3 
TOB 4,5 


TOB 4,5,6 
TIB 1,2,3 or TOB 4,5,6 
TIB 1,2,3 



Table 13: Width of the margins used to define the fiducial area in different sub-detectors, 
deduced from the size of the sensitive volume as described in the tracker geometry. 



The margins were 





Module Edges 


Inter-sensor Gap 




local X 


local y 


local y 


TIB 


3 mm 


5 mm 




TOB 


5 mm 


10 mm 


± 10 mm 



• One hit in the first TIB layer and one hit in one of the two last TOB layers. 

• At least four reconstructed hits, with at least three matched hits from stereo layers. 

• At most five "lost" hits and no more than three consecutive ones. 



These cuts reject cosmic showers, define the acceptance, ensure a correct measurement of the track in both the r-<f> 
and r-z planes and compensate for the removal of the hits in the layer under test. 

The efficiency for a module was measured by asking for an intersection with the interpolated track and by checking 
for the presence of a hit. For increased robustness in the presence of misalignment the distance between the hit 
and the predicted track position was not used in the selection. An upper cut of 30° on the angle of incidence of the 
track with respect to the normal to the module plane, applied in TIB layer 2, selected topologies similar to the ones 
expected from proton-proton collisions. 

In order to avoid artificial inefficiencies at the edge of the sensitive region and in the central gap of double-sensor 



modules a fiducial area was defined (Table 13 i. Intersections that were either inside the margin or passing closer 
than five times the predicted error from the module edge or the central gap were excluded from the analysis, as 
were known inactive modules. After these cuts the number of intersections per layer is of the order of 15000, both 



at room temperature and at —10° C. Results for runs at room temperature are shown in Figure 18 Runs taken at 
— 10° C show almost identical efficiencies. 




Figure 18: Summary of the layer efficiency at room temperature. 

The efficiency exceeds 99.8% for all measured layers. More detailed results are reported in Ref. The method 
has been validated on simulated events and data. In simulation it yields efficiencies compatible with 100%, as 
expected. In data an artificial inefficiency of 5% was generated by randomly removing hits from the third TIB and 
the fourth TOB layer. The algorithm perfectly reproduces this inefficiency. 
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8.2 Hit Resolution 



The hit resolution was studied by considering reconstructed tracks without using the detector layer under study 
in the estimation of the track parameters. To minimize the effects of any potential misalignment and reduce the 
effects from the need to extrapolate track position estimates between detector layers events were selected with two 
consecutive hits in the same layer. Use of these so-called overlap hits also serves to reduce the potential amount of 
detector material between hits. The "backward (forward) predicted" track parameters using layers beyond (before) 



the layer under study were combined to make a prediction of the track position and direction as shown in Figure 19 
The difference between the local x-positions on the two modules in the overlap layer are compared for the actual 
detector hits and the prediction. This procedure minimizes the uncertainty on the predicted position by accounting 
for correlations between the positions in the two modules which otherwise would dominate the measurement. 
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Figure 19: Schematic of an overlapping pair of modules where the forward and backward predicted parameters are 
combined to make the best estimate of the track without the overlap layer. Xhit and Xpred are measured in local x 
coordinates, i.e., perpendicular to the strips, with Axhu = Xhui — Xhit2 and Axpred = Xpredi — Xpred2- 

The sample of events is selected from all the data collected at scintillator position C requiring: 

• Exactly one track reconstructed by the CKF. 

• At least six reconstructed hits in the barrel region only. 

• A normalized < 30. 



Additionally, all overlap pairs collected from the data must have an error on Axpred < 50/xm. 

In the TIB layers, the mean uncertainty for the predicted position difference for tracks passing through overlapping 
modules (< cr( Axpred) >) is found to be w 40 /im after cuts, while the mean uncertainty for the track hit position 
difference (< (t( Ax^^j) >) is much smaller w 20/im. In these layers it is thus not possible to extract a meaningful 
test of the accuracy of the hit position error estimates due to the large uncertainties in the predicted positions. 

In the TOB layers, however, the situation is reversed and the < a{Axpred) > are smaller than the < a{Axfiit) >■ 
The hit position uncertainties are calculated theoretically from the detector geometry. The accuracy of these hit 
position uncertainties is compared with the resolution as measured in the data by finding the fitted width of the 
distribution of the hit position difference between the two modules in the overlap region compared to the predicted 
position difference for each track. 

The fitted width of Axhu — Axpred and the values of < a{Axhit) > and < a{Axpred) > for each overlap 



position are shown in Figure 20 The width should be a combination of the uncertainty due to the predicted 
position difference and the hit position difference. In general the fitted width is close to the quadratic sum of 
the expected uncertainties on predicted and hit positions. Outliers are due to residual misalignment between the 
modules in some of the pairs. 



In Figure 20 within each layer, the overlap pairs are ordered by decreasing track angle with respect to the detector 
surface normal. As the tracks approach normal incidence in the plane normal to the strips < a{Axhit) > gets 
larger and < a{Axpred) > gets smaller giving better precision on the uncertainty due to hit resolution. This 
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Figure 20: Fitted width of the distribution of the difference between the measured and the expected a; -position 
differences (circles), the mean uncertainty of the predicted position difference (upward pointing triangles) and the 
hit position difference (downward pointing triangles) for overlap hits in the data. Each bin represents one module 
pair The uncertainties for the measured width are extracted from the fit to a Gaussian distribution. The RMS 
values for a{Axhit) and a{Axpred) are of the order of a few /xm. 



dependence of the resolution on the local track angle is illustrated in Figure 21 for data and simulated events. Only 
the module type with the highest statistics and sensitivity to the hit resolution is shown, corresponding to overlaps 
in TOB layers 1 to 4. 
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Figure 21: Fitted width of the distribution of Axhu — Axpred and the mean uncertainties for Aa 



and Axhit 



for TOB layers 1 to 4 as a function of local dx/dz for data (left) and simulation (right). The symbols are the same 



as in Figure 20 



To quantify the accuracy of the hit position uncertainties, a scale factor, a, is used on (t{Ax hit)- In each overlapping 
pair of modules the variable a is fit according to 

= a^ < a{Ax,M) >^ + < a{Axpred) >^ (2) 

where w is the fitted width of Ax^it — Axpred- With an accurate understanding of the hit position uncertainties, 
the value of a should be consistent with one. The best precision for a is reached at normal incidence, while for 
shallow angles the uncertainties on the prediction dominate over the hit resolution. 

The values of are found to be dependent on the angle at which the track hits the detector surface as measured 
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by dx/dz in local coordinates. Figure 22 shows the values of as a function of \dx/dz\ for TOB layers 1 to 4 in 
data and simulation. An estimate of a^t dx/dz = is extracted using a linear fit at low values of \dx/dz\. 

The average momentum of cosmic rays is low and <j{Axpred) is dominated by multiple scattering. Since the mo- 
mentum is not measured for individual tracks a systematic term is added. The width of the momentum distribution 
in simulation is translated into an uncertainty of 45% on (j{Axpred)- The calculation of is repeated, varying 
a{/S.Xpred) by ±45%. Half of the difference is attributed as a systematic error, which is the dominant uncertainty. 
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Figure 22: Values of per module pair as a function of local \dx/dz\ for TOB layers 1 to 4 in data (left) and 
simulation (right). A linear fit is used to estimate the value of at dx/dz — 0. 

In conclusion, the hit resolution is well understood in the inner TOB layers with the best precision at small angles. 
The resolution of a particular module is dependent on the track angle. The uncertainties on the width of the double 
difference ^Xhu — ^Xpred range from 20 — 60/im. A scaling factor for the nominal hit resolution was determined 
as a function of the angle of incidence. For normal incidence, and averaging over the first four TOB layers, a 
scaling factor a — 0.98 ± 0.04 was found for a nominal resolution of « 65^m. The error includes statistical 
uncertainties and the systematic effects due to the unknown momentum. 



9 Conclusions 

A large sample of over 4.7 million cosmic ray events was recorded at the CMS Tracker Integration Facility (TIF) in 
the period from November 2006 to July 2007. These data, taken under various running conditions, were success- 
fully processed by the tracker offline software. This allowed for a validation of the local (module-level) and global 
(track) reconstruction algorithms under realistic conditions, using detector condition, alignment and caUbration 
information derived from data. 

Compared to collision data, track reconstruction at the TIF profits from the low occupancy in cosmic events. On the 
other hand, the task is more complex due to the absence of constraints on the origin of the tracks. Track momenta 
are low and cannot be measured directly in the absence of a magnetic field. Three different algorithms were used 
to reconstruct tracks: one dedicated cosmic track finder and the two standard modules designed for LHC collision 
data. The latter ones were reconfigured in order to optimize for low multiplicity cosmic events. 

All three algorithms proved to be stable in different running conditions. The results allow to assess the tracker 
performance at the hit and track level. Reconstructed tracks have been used to verify the full efficiency of hit 
reconstruction at the module level and to provide a first measurement of the hit resolution using overlapping 
modules within a layer The efficiency exceeds 99.8% for all barrel layers. At normal incidence the hit resolution 
in the TOB was found to be compatible with the expected one. The track reconstruction efficiency was measured 
by comparing reconstructed segments in different parts of the detector. Relative efficiencies were found to be in the 
range 90 to 99%, depending on the reconstruction algorithm and the subdetector. The momentum spectrum could 
be derived from the effects of multiple Coulomb scattering, and the measurement of the energy loss confirmed the 
resolution of about 9% expected from Monte Carlo simulation. 

The results presented in this note constitute an important step in the commissioning of the tracker offline software 
and in the preparation for the LHC data taking. Data taken at the TIF allowed for the first time to confirm sim- 
ulation results on the performance of the tracker and the reconstruction programs on a larger scale. The special 
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configuration of the track reconstruction algorithms developed for TIF data will be reused in the next stage of 
the commissioning process, i.e., for cosmic ray data taken in situ with the full CMS detector. Also the analysis 
procedures will continue to be used for the measurement and monitoring of the tracker performance. 
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